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Abstract. In this paper we describe the performance of the WaveBurst algorithm 
which was designed for detection of gravitational wave bursts in interferometric data. 
The performance of the algorithm was evaluated on the test data set collected during 
the second LIGO Scientific run. We have measured the false alarm rate of the algorithm 
as a function of the threshold and estimated its detection efficiency for simulated burst 
waveforms. 
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1. Introduction 

A direct observation of gravitational waves (GW) produced by astrophysical sources is 
an ultimate goal for a new generation of detectors based on laser interferometry £Q-[I]. A 
broad class of expected signals are bursts which are transients of gravitational radiation 
with short duration, typically less then a second. They may be produced by sources 
such as supernova explosions, mergers of binary inspiral systems, gamma ray bursts, 
and other violent and energetic phenomena in the universe. The first result of burst 
searches with LIGO detectors has been recently reported by the LIGO collaboration 

At the present time the waveforms of gravitational waves from burst sources are 
poorly known. Consequently, burst searches employ data analysis algorithms which 
identify bursts with a broad range of possible waveforms. Recently several such 
algorithms have been developed [HI III IBJ E] , including the algorithm called WaveBurst 
[TU] . The WaveBurst algorithm is based on wavelet transformations and allows detection 
of a wide class of GW bursts by using a bank of wavelet packets [TTJ H2j . 

In this paper we describe a data analysis pipeline with a particular implementation 
of the WaveBurst algorithm and evaluate the pipeline performance on the data from 
the second LIGO Scientific run (S2). The S2 data was collected from all three LIGO 
detectors during a two month period beginning on February 14, 2003. We used a small 
fraction (~ 10%) of the S2 data, which was selected as the test (playground) set for 
the purpose of tuning and evaluation of the burst detection algorithms. The paper is 
organized as follows: we begin with a brief description of the data analysis pipeline, 
present the measurements of the false alarm rates, and conclude with the estimation of 
the pipeline sensitivity. 

2. WaveBurst data analysis pipeline 

The block diagram of the WaveBurst data analysis pipeline is shown in Figure ^ 
The pipeline takes as input the raw time series from the GW channels of the three 
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Figure 1. The block diagram of the WaveBurst data analysis pipeline for three LIGO 
detectors: HI, H2 and LI. 
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LIGO detectors and generates a set of potential GW burst events (triggers). The data 
processing stages of the pipeline are: 

• production of burst triggers with the WaveBurst event trigger generator (ETG), 

• reconstruction of events coincident in all three LIGO detectors, 

• final selection of the triple coincidence events. 

2.1. Production of burst triggers 

The WaveBurst ETG is an implementation of the burst analysis method described 
in ^Qj- The method is based on wavelet transformations which allow time-frequency 
representation of data. The bursts are identified by looking for regions in the wavelet 
domain with an excess of power inconsistent with stationary detector noise. 

The ETG takes data from the GW channels of two detectors and produces a list 
of coincident burst triggers. The data is processed in the following steps: 1) wavelet 
transformation, 2) selection of wavelet amplitudes, 3) coincidence between the channels, 
4) generation of burst triggers, and 5) selection of burst triggers. During steps 1), 2) 
and 4) the data processing is independent for each channel. During steps 3) and 5) data 
from both channels is used. 

The input data are time series with duration of 120 seconds and sampling rate of 
8192 Hz. Using the orthogonal wavelet transformation^ the time series are converted 
into wavelet series Wij, where % is the time index and j is the wavelet layer index. 
Each wavelet layer can be associated with a certain frequency band of the initial 
time series. Therefore, the wavelet series can be displayed as a time-frequency 
scalogram consisting of 64 wavelet layers with n = 15360 pixels (data samples) 
each. The time-frequency resolution of the WaveBurst scalograms is the same for 
all wavelet layers (1/128 sec x 64 Hz), which is different from dyadic time-frequency 
resolution of conventional wavelets ^01 E3J The constant time-frequency resolution 
makes the WaveBurst scalograms similar to spectrograms produced with windowed 
Fourier transformations. 

For each layer we select a fixed fraction (P) of pixels with largest absolute 
amplitudes, which are called black pixels. The number of black pixels is nP. All other 
wavelet pixels are called white pixels. Then we calculate rank statistics for the black 
pixels. The rank Rij is an integer number from 1 to nP with the rank 1 assigned to 
the pixel with the largest amplitude. Given the rank R^, the following non-parametric 
statistic is computed 



For white pixels the value of is set to zero. The statistic yij has a meaning of the pixel 
logarithmic significance. Assuming Gaussian detector noise, the logarithmic significance 

| SYMLET wavelet with the filter length of 60. 




(1) 
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can be also calculated as 




ln(P) - In ( J2/^ [°° e~ x2/2 dx J , (2) 



where Wij is the absolute value of the pixel amplitude in units of the noise standard 
deviation. In practice, the LIGO detector noise is not Gaussian and its probability 
distribution function is not well known. Therefore, we use the non-parametric statistic 
t/ij, which is a more robust measure of the pixel significance than y^. Using the inverse 
function of gp with as an argument, we introduce the non-parametric amplitude 

Wij = 9p 1 (y*j), (3) 
and the excess power ratio 

Pij = ™% - 1, (4) 

which characterizes the pixel excess power above the average detector noise. 

After the black pixels have been selected, we require a coincidence between the 
channels. Given a black pixel in the first channel, it is accepted or rejected depending 
on a condition applied to pixels in the corresponding time-frequency area of the second 
channel. The pixel is accepted if the significance of pixels in the second channel satisfies 

V(i-i)j + Vij + V(i+i)j > V, (5) 

where 77 is the coincidence threshold. Otherwise, the pixel is rejected. This procedure 
is repeated for all black pixels in the first channel. The same coincidence algorithm 
is applied to pixels in the second channel. As a result, a considerable number of black 
pixels in both channels produced by fluctuations of the detector noise is rejected. At the 
same time, black pixels produced by coincident bursts have a much higher acceptance 
probability because of the coherent excess of power in two detectors. 

After the coincidence procedure is applied to both channels, the channels may have 
similar but not identical patterns of black pixels which form clusters. The clusters 
can be reconstructed separately for each channel. However, in the present WaveBurst 
algorithm, we merge the black pixels from both channels into one time-frequency 
scalogram and then run a cluster analysis. For each black pixel we define neighbors, 
which share a side or a vertex with the black pixel. Neighbors can be both black 
and white pixels. The white neighbors are called halo pixels. We define a cluster as 
a connected group of black and halo pixels. After the cluster reconstruction, we go 
back to the original time-frequency scalograms and calculate the cluster parameters 
separately for each channel. There are always two clusters, one per channel, which form 
a WaveBurst trigger. 

The cluster parameters are calculated using black pixels only. For example, the 
cluster size k is defined as the number of black pixels. Other parameters, which 
characterize the cluster strength, are the cluster excess energy ratio p and the cluster 
logarithmic likelihood Y k : 

p= 12 Pa> Yk= 12 ya> ( 6 ) 

i,jec(k) i,jec{k) 
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where C(k) defines a set of black pixels in the cluster. The cluster size and the excess 
power ratio are used for the selection of triggers. A trigger is reported by the ETG if 
both clusters satisfy the conditions: k > and p > 6.25. 

2.2. ETG tuning 

There are two main ETG parameters: the black pixel fraction P and the coincidence 
threshold rj. The purpose of these parameters is to control the average black pixel 
occupancy 0(P,r)) of the scalograms used for the cluster reconstruction. To ensure 
robust cluster reconstruction, the occupancy should not be greater than 1%. For 
white Gaussian detector noise the functional dependence 0(P,r]) can be calculated 
analytically. We require that 0(P, rj) = 0.7%, which sets a constraint on P and rj. 

The selection of black pixels effectively sets a threshold on the wavelet amplitudes: 
Wij > (7p 1 (0). The larger the value of P - the lower the threshold. However, the black 
pixel fraction should not be greater than 31.7%, otherwise pixels with negative values 
of pij would be taken into the analysis. From the other side, with P set too small (less 
than a few percent), noise outliers due to instrumental glitches may consume the entire 
time-frequency volume available for black pixels and thus mask gravitational waves. To 
avoid saturation from the instrumental glitches, we run the analysis with P equal to 
10%. Together with the occupancy constraint above it sets the coincidence threshold of 
1.5. 

For S2 playground data and selected values of P and rj, the average trigger rate 
is approximately 6 Hz. Within a factor of two it is consistent with the false alarm 
rate expected for the white Gaussian detector noise. To reduce the ETG rates to 
manageable level (~ 1 Hz), we set an intermediate threshold on the cluster excess 
power ratio (Section 12. 1|) and postpone the final selection of the pipeline threshold for 
the post-production analysis (Section 12. 4j) . In this case, there is no need to re-run the 
ETG, which is CPU and time consuming operation, for tuning of the pipeline false alarm 
rates and sensitivity. 

2.3. Triple coincidence 

The output of the WaveBurst ETG is a set of double coincidence triggers for a selected 
interferometer pair X x Y . For three LIGO interferometers there are three possible pairs: 
LI x HI, HI x H2 and HI x LI, where HI and H2 are two Hanford interferometers and 
LI is the Livingston interferometer. To identify triple coincidence events, we require a 
time-frequency coincidence of the WaveBurst triggers generated for these three pairs. 
For time coincidence the following conditions are required 

\Tlihi — Thih2\ < T w , \Th2Li — Thih2\ < T w , \Th2L1 — Tlihi\ < T w , (7) 

where T w is the time window (T w = 20 ms) and Txy is the average center time of the 
X and Y clusters. Given the time U for individual pixels, the cluster center time is 
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calculated as 

t= E urf,/ E < (8) 

ijec(k) ijec(k) 

We also apply a loose requirement on the frequency coincidence of the WaveBurst 
triggers. First, we calculate the minimun (f m i n ) an d maximum (f max ) frequency for 
each interferometer pair X x Y 

fmin = m i n (floun flow)' fmax = max (fupi fup)i (9) 

where f\ ow and f up are the lower and upper frequency boundaries of the X and Y 
clusters. Then the trigger frequency bands are calculated as f max — f m i n for all pairs. 
For frequency coincidence, the bands of all three WaveBurst triggers are required to 
overlap. 

2.4- Final selection of triple coincidence events 

The triple coincidence events consist of 3 WaveBurst triggers or 6 clusters. The cluster 
parameters are calculated separately for each interferometer, as described in Sect ion ETT1 
For the final selection of the burst triggers we use the cluster significance 

(10) 

derived from the logarithic likelihood Yj, ^Uj. Given the significance of six clusters, we 
compute the combined significance of the triple coincidence event as 

7 _ ( 7 L\ 7 H1 7 H2 7 Ll 7 H1 7 H2 A 1 / 6 /-i-in 

— y^Lim a lihi Zj H2Li A H2Li a h\H2 A mm) > 

where Z^ Y (^xy) ^ s ^ e significance of X (Y) cluster for the X xY interferometer pair. 
Figure 121 shows the combined significance distribution for the false alarm events found 
in the S2 playground data. To control the pipeline false alarm rate, we set a threshold 
on the value of the combined significance. 

3. False alarm rate 

Assuming there is no physical correlation between the Hanford and Livingston sites, 
the false alarm (FA) rate is dominated by accidental triple-coincidence events produced 
by fluctuations of the detector noise. For estimation of the FA rate we perform the 
entire WaveBurst analysis on the data with artificial time shifts between the Livingston 
and Hanford detectors. The false alarm triggers are generated for 46 time lags between 
— 115 and 115 seconds with time step of 5 seconds. The zero time lag is excluded from 
the analysis. Figure El shows the measured rates at different time lags before the final 
selection cut is applied to the combined significance. The measurements at different 
time lags are consistent with the average rate of 16.9 Hz. 

As shown in Figure EJ the FA rate depends strongly on the WaveBurst combined 
significance cut. Without compromising much the pipeline sensitivity, the significance 
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Figure 2. The combined significance distribution of the WaveBurst triggers produced 
with unphysical time lags between the Livingston and Hanford detectors. 



20r 



1 1 ^mm^t^ . ., 



14! 

12; 
10: 

8_ 
6 : 
4; 
2 



-100 



X 2 /ndf 43.59/45 
pO 16.91 ±0.07779 



-50 



50 100 
lag time, sec 



Figure 3. The false alarm rate before the Zq selection cut as a function of the time 
lag between the Livingston and Hanford detectors. 



threshold is set to 5.47 (or Iu(Zg) > 1.7). At this threshold the FA rate is approximately 
230 /iHz for the WaveBurst triggers in the frequency band 64 — 4096 Hz. The rate is 
strongly dependent on the frequency band selected for the analysis. For example, for the 
frequency band below 1100 Hz, the measured FA rate is approximately 15 yuHz. Both 
the FA rate and the pipeline sensitivity depend on the significance threshold. Varying 
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Figure 4. The false alarm rate as a function of the threshold on the combined 
significance. 



the threshold, we can study the dependence of the FA rate on the sensitivity, which is an 
important characteristic of the pipeline. Figure El shows the pipeline rate as a function 
of the sensitivity for frequency band below 1100 Hz. 

4. Simulation 

For estimation of the WaveBurst detection efficiency we studied the response of the 
analysis pipeline to simulated signals. Simulated signals with different amplitudes 
were injected into the GW data streams from three LIGO detectors. Then we applied 
the WaveBurst algorithm to find the injected signals and thus estimated the pipeline 
sensitivity. 

Several ad-hoc and astrophysically motivated waveforms were selected for 
injections. These included Gaussian, sine-Gaussian waveforms and the simulated BH- 
BH merger waveforms described in [Tl|. To make the injections as realistic as possible 
we took into account the antenna pattern functions of the LIGO detectors and injected 
the waveforms at random times, accounting for the delay in the arrival time between 
the Hanford and Livingston sites. 

4-1. Simulation Procedure 

A general GW burst is comprised of two waveforms h + (t) and h x (t) which represent 
two polarizations of the gravitational wave. The signal produced at the output of the 
GW detector is a linear combination of these waveforms: 



h{t)=F + h + (t) + F x h x (t), 



(12) 




0.35 0.4 0.45 0.5 0.55 0.6 0.65 0.7 0.75 

hrss, [strain/sqrtHz] 

Figure 5. The dependence of the false alarm rate on the pipeline sensitivity for 
sine-Gaussian waveforms with / = 235 Hz and Q = 9. Solid circles correspond to 
the WaveBurst triggers with ln(Z G ) greater than 0, 0.5, 1.0, 1.25, 1.5, 1.7, 2.0, 2.5 
(counting from the left). For convenience, the rate-hrss dependence is approximated 
with a solid line. 

where F + and F x are the antenna pattern functions. These functions depend on the 
source location in the sky (spherical angles 9 and 0) and its polarization angle ip. To 
generate the antenna pattern functions for Hanford detectors, we explicitly construct 
the rotational transformation from the source coordinate frame to the Hanford frame. 
To obtain the LI antenna pattern we apply a second transformation - rotation from the 
Hanford to the Livingston detector frame. 

The amplitudes of injected signals are varied to obtain the sensitivity of the 
algorithm as a function of the injection strength. For the Hanford data, the simulated 
signals are injected randomly in time at the average rate of 5 per minute. For the 
Livingston data, the same set of waveforms is injected with time delays, uniquely defined 
by the source coordinates 9 and 0. The source coordinates are generated randomly, 
so the sources appear distributed uniformly on the sky (all sky simulation) and their 
polarization angles take random values between and 2ir. In addition, we injected 
waveforms with the same strength for both sites, ignoring the antenna pattern functions. 
Although this simulation does not correspond to any meaningful source population, it 
allows us to remove the contribution of the antenna pattern functions and estimate the 
best pipeline sensitivity. Below we refer to this the simulation with optimal 

orientation. 
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4-2. Pipeline sensitivity 

The detection efficiency is a function of the injected signal strength. For a given signal 
strength, the detection efficiency is defined as the ratio of the number of detected 
waveforms to the total number of injected waveforms. We define the strength of an 
arbitrary burst signal as a root-sum-square strain amplitude 

^rss = {/ [h 2 + (t) + h 2 x (t)] rft| V2 . (13) 

For example, the detection efficiency curve as a function of h ISS is shown in Figure El for 
one of the simulated signals. We determine the strength of signals detected with 50% 
efficiency (h 50 %) and use it as a measure of the WaveBurst sensitivity. The results on 
the measured sensitivities are presented below for different injected waveforms and the 
combined significance threshold of 1.7. 




Figure 6. The detection efficiency for sine-Gaussian waveforms (/ = 235 Hz, Q = 9, 
optimal orientation). 



The first class of injected waveforms corresponds to the GW bursts, which can 
be produced in the final plunge of two coalescing black holes. Both h + (t) and h x (t) 
waveforms have been obtained by the Lazarus group as the result of numerical 

simulation of the merger and ringdown phases of a binary black hole system. These 
waveforms are parametrized by the total mass of the binary system in units of Solar 
mass M . The results on the pipeline sensitivity are summarized in Tabled 

Another class of injected waveforms is the sine- Gaussian wave jH] with the center 
frequency / and the quality factor Q = y/2nfr, where r defines the signal duration. 
The results of the pipeline sensitivity to the sine-Gaussian waveforms are summarized 
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Table 1. WaveBurst sensitivity (in units of 10 21 strain/ VTlz) for black hole 
waveforms (all sky simulation). 

in Table 121 assuming only one polarization (h + ) of the injected signals. The sensitivity 
is greatest to the sine-Gaussian waveform with / = 235 Hz, which lies in the frequency 
band with maximum detector sensitivity. Note that the pipeline has approximately the 
same sensitivity for two different types of sine-Gaussian waveforms: Q = 3 and Q — 9. 
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Table 2. WaveBurst sensitivity (in units of 10 21 strain/ VHz) for sine-Gaussian 
waveforms (optimal orientation and all sky simulations). 

Finally, we estimated the WaveBurst sensitivity to the pure Gaussian waveforms 
jS], which are characterized by the duration r only. For these signals one polarization 
(h + ) and optimal orientation are assumed. The results are summarized in Table El 
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Table 3. WaveBurst sensitivity (in units of 10 21 strain/ 'VHz) for Gaussian 
waveforms (optimal orientation). 



5. Conclusion 

WaveBurst is a novel method for detection of gravitational wave bursts. It works in 
the wavelet domain and allows detection of a wide class of GW bursts by using a 
large bank of wavelet packets. Using the S2 LIGO playground data we evaluated the 
performance of the WaveBurst data analysis pipeline. The pipeline sensitivity is limited 
by thresholds on: the wavelet amplitudes (defined by the ETG parameters P and 77), 
the cluster excess power ratio, and the combined significance of the triple coincidence 
events. The maximum pipeline sensitivity is approximately 4 x 10~ 21 strain /y/Hz for 
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sine-Gaussian signals corresponding to optimal orientation of sources with respect to the 
LIGO detectors. Averaged over the entire sky, the pipeline strain sensitivity becomes 
14 x 10~ 21 strain/ VHz. The false alarm rate of the pipeline is dominated by accidental 
triple coincidence events produced by fluctuations in the detector noise. Using time shift 
analysis we estimated the WaveBurst false alarm rate as a function of the combined 
significance threshold. For the threshold of 1.7, the false alarm rates are 15 /xHz and 
230 yuHz for frequency bands below 1100 Hz and 4096 Hz respectively. Thus we have 
shown that the WaveBurst algorithm has low false alarm rates and high sensitivity to 
simulated burst waveforms. 
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